Should Non-Sensitive Attributes be Masked? Data Quality Implications of Data Perturbation in Regression Analysis

نویسنده

  • Sumitra Mukherjee
چکیده

Ensuring the security of sensitive data is an increasingly important challenge for information systems managers. A widely used technique to protect sensitive data is to mask the data by adding zero mean noise. Noise addition affects the quality of data available for legitimate statistical use. This article develops a framework that may be used to analyze the implications of additive noise data masking on data quality when the data is used for regression analysis. The framework is used to investigate whether noise should be added to non-sensitive attributes when only a subset of attributes in the database are considered sensitive, an issue that has not been addressed in the literature. Our analysis indicates that adding noise to all the attributes is preferable to the existing practice of masking only the subset of sensitive

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Faults and fractures detection in 2D seismic data based on principal component analysis

Various approached have been introduced to extract as much as information form seismic image for any specific reservoir or geological study. Modeling of faults and fractures are among the most attracted objects for interpretation in geological study on seismic images that several strategies have been presented for this specific purpose. In this study, we have presented a modified approach of ap...

متن کامل

A Reliable Routing Algorithm for Delay Sensitive Data in Body Area Networks

Wireless body Area networks (WBANs) include a number of sensor nodes placed inside or on the human body to improve patient health and quality of life. Ensuring the transfer and receipt of data in sensitive data is a very important issue. Routing algorithms should support a variety of service quality such as reliability and delay in sending and receiving data. Loss of information or excessive da...

متن کامل

Predicting of the Quality Attributes of Orange Fruit Using Hyperspectral Images

Background: Hyperspectral image analysis is a fast and non-destructive technique that is being used to measure quality attributes of food products. This research investigated the feasibility of predicting internal quality attributes, such as Total Soluble Solids (TSS), pH, Titratable Acidity (TA), and maturity index (TSS/TA); and external quality attributes such as color components (L*, a*, b*)...

متن کامل

مقایسه ابعاد عملکرد خانواده و کیفیت زندگی و رابطه این متغیرها در بین افراد معتاد و غیر معتاد

Introduction: The aim of this study was the comparison of family functioning and quality of life and their relationships among addicted and non-addicted persons. Method: The research method of study was ex-post factor. The sample of study was 107 addicts and 107 non-addicts. Sampling of addicts was clustering random sampling. Non-addicts were matched in terms of demographical characteristics an...

متن کامل

Global Measures of Data Utility for Microdata Masked for Disclosure Limitation

When releasing microdata to the public, data disseminators typically alter the original data to protect the confidentiality of database subjects’ identities and sensitive attributes. However, such alteration negatively impacts the utility (quality) of the released data. In this paper, we present quantitative measures of data utility for masked microdata, with the aim of improving disseminators’...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998